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Abstract 

We analyze the mean cost of the partial match queries in random two- 
dimensional quadtrees. The method is based on fragmentation theory. The 
convergence is guaranteed by a coupling argument of Markov chains, whereas 
the value of the limit is computed as the fixed point of an integral equation. 
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1. Introduction 

Introduced by Finkel and Bentley [3] , the quadtree structure is a comparison based 
algorithm designed for retrieving multidimensional data. It is often studied in computer 
science because of its numerous applications. The aim of this paper is to study the 
mean cost of the so-called partial match queries in random quadtrees. This problem 
was first analyzed by Flajolet et al. [B]. 

Let us briefly describe the discrete model. We choose to focus only on the two- 
dimensional case. Let P\, . . . , P n be n independent random variables uniformly dis- 
tributed over (0, l) 2 . We shall assume that the points have different x and y coor- 
dinates, an event that has probability 1. We construct iteratively a finite covering 
of [0,1] 2 composed of rectangles with disjoint interiors as follows. The first point Pi 
divides the original square [0, l] 2 into four closed quadrants according to the vertical 
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and horizontal positions of P\ . By induction, a point Py. divides the quadrant in which 
it falls into four quadrants according to its position in this quadrant, see Fig. 1. Hence 
the n points Pi, . . . ,P„ give rise to a covering of [0, l] 2 into 3n + 1 closed rectangles 
with disjoint interiors that we denote by Quad(Pi, . . . , P n ). 




FIGURE 1: Two splittings of [0, l] 2 with resp. 8 and 100 points. 



We are interested in the partial match query. As explained by Flajolet and Sedgewick 
Example VII. 23.], given xq £ [0, 1], it determines the set of points Pi, i € {1, . . . , n}, 
with x coordinates equal to xo, regardless of the y coordinates (that set is either empty 
or a singleton). Denoting the vertical segment [(x, 0), (x, 1)] by S x , the cost of this par- 
tial match query is measured by the number Af n (x) of rectangles of Quad (Pi, . . . , P n ) 
intersecting S x minus 1 (Ao(i) = by convention). Our main result is: 

Theorem 1. For every x € [0, 1], we have the following convergence: 
n-P*E[Af n (x)] — > K (x(l-x)) r/ \ 

L n— >-cso 

, fl * 717-3 r(2/r + 2)r(/r + 2) 

where p = ana Kq — -. r-. 

2 2r 3 (/3* + l)r 2 f^ + lj 

Flajolet et al. 6j obtained the convergence in mean of the cost of partial match queries 
when x is random with the uniform law on [0, 1] and independent of Pi, . . . , P„. We 
shall give another proof of this result using fragmentation theory. As a by-product 
of our techniques, we shall prove in Corollary [2] below that when rescaled by n 1- ^ 2 , 
J\f n (0) converges in L 2 (its convergence in mean was obtained in [B]). 

The paper is organized as follows. Section [2] introduces the model embedded in 
continuous-time and presents the first properties. Section [3] is devoted to the link 
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between quadtrees and fragmentation theory. Section [4j the most technical one, 
contains the proof of the convergence at a fixed point x without knowing the limit. 
The identification of the limit is done in Section [5] using a fixed point argument for 
integral equation. 

Acknowledgement. We would like to express our gratitude to Philippe Flajolet 
who introduced us to the problem of partial match query. We are indebted to Nicolas 
Broutin and to Ralph Neininger for fruitful discussions. We also deeply thank Jean 
Bertoin for his careful reading of the first versions of this work. 

2. Notations and first properties 

In order to apply probabilistic techniques, we first introduce a continuous-time 
version of the quadtree: the points Pi , . . . , P n are replaced by the arrival points of 
a Poisson point process over K + x [0, l] 2 with intensity dt <g> dxdy. All the results 
obtained in this model can easily be translated into results for the discrete-time model. 

2.1. The continuous-time model 

Let IT be a Poisson point process on K + x [0,1] 2 with intensity dt ® dxdy. Let 
((Ti,Xi,yi),i > 1) be the atoms of II ranked in the increasing order of their r-componcnt. 
We define a process (Q(i))t>o with values in finite covering of [0, l] 2 by closed rectangles 
with disjoint interiors as follows. We first introduce the operation SPLIT: for every 
subset R of [0, l] 2 and for every (x, y) £ [0, l] 2 , 

SPL\T(R,x,y) = {RD[0,x] x [0, y], Rn [0, x] x [y,l],Rn[x,l] x [Q,y],Rn[x,l] x [y,l]}. 

In other words, if R is a rectangle with sides parallel to the x and y axes, then 
SPLIT(_R, x, y) is the set of the four quadrants in R determined by the point (x,y). 
We may now recursively define the process (Q(£))t>o- Let To = 0. For every t G [0, T\), 
define Q(i) = {[0, l] 2 }, and for every t £ [r^Ti+i), denoting by R the only element (if 
any) of Q(r,_i) such that (xi,yt) is in the interior of the rectangle i?, let 



Q(t) - SPLIT(ii, Xi, yC) U Q (rj-i) \ {R}. 
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Observe that a.s., for every i E Z + , there indeed exists a unique rectangle of Q(rj) such 
that (x i+ i,y i+ i) is in its interior, hence the process (Q(i))t>o is well defined up to an 
event of zero probability. In the sequel we shall assume that the points of II always fall 
in the interior of some rectangle of (Q(t))t>o- As explained in the introduction, we are 
interested in the number of rectangles of Q(t) intersecting the segment S x , specifically 
we set: 

JV t (a;) = #{i?eQ(i) :RnS x ?Q}-l, 

so that N t (x) = for every < t < t-\_. Recalling that r„ is the arrival time of the n-th 
point of II, Q(r„) has the same distribution as the random variable Quad(Pi, . . . , P n ) 
of the introduction. In particular, for every (n, x) e N x [0, 1], we have N Tn (x) — Af n (x) 
in distribution. 

2.2. Main equations 

Let x G [0, 1]. We denote by A the set of words over the alphabet {0, 1}, 

A= \J{0,l} n , 

n>0 

where by convention {0, 1}° = {0}. Thus, if u £ A, u is cither or a finite sequence 
of and 1. If u and v are elements of A then uv denotes the concatenation of the 
two words u and v. We label the rectangles appearing in (Q(t))t>o whose intersection 
with the segment S x is non-empty by elements of A according to the following rule. 
By convention R (x) is the unit square [0, l] 2 . The first point (n,xi,yi) of Id splits 
[0, l] 2 into four rectangles, a.s. only two of them intersect S x , we denote the bottom 
rectangle by Rq(x) and the top one by Ri(x). Inductively, for every u € A, a point of 
II eventually falls into R u (x), dividing it into four rectangles. Almost surely, only two 
of them intersect S x , denote the bottom one by R u q{x) and the top one by R u \(x). 
For u e A, we denote the minimal (resp. maximal) horizontal coordinate of R u {x) by 
G u (x) (resp. D u (xj), and define the place of x in R u (x) to be 

u[ ' D u (x)-G u (xY 
If u 7^ 0, wc denote the parent of u by tz~ which is the word u without its last letter. 
We write M u (x) for the ratio of the (two-dimensional) Lebesgue measure Leb(i?„(a;)) 
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of R u {x) by the measure of R\{x), 

= Leb^)) 

We also set for all x e [0, 1], M (x) — 1. For u £ {0, 1} and t > 0, we introduce the 
"subquadtree" Q u ,x(t) = {R £ Q(t + ri) : R C i? u (a;)}. Then, for every t > 0, one has: 

JV t (s) = i t > ri + i 4 > Tl J2 (f{ReQ u , x (t-n):RnS x ^<D}-i). (l) 

ue{o,i} 

If R is a rectangle with sides parallel to the x and y axes, we denote by : 
M 2 — > R 2 the only affine transformation that maps the bottom left vertex of R to 
(0,0), the bottom right vertex of R to (1,0) and the up left vertex of R to (0,1). 
It should be plain from properties of Poisson point measures that, conditionally on 
(M u (x),X u (x),Ru(x)), the process (&R u ( x )(Qu,x(t)))t>o has the same distribution 
as the process {Q(M u (x)t)) t >o, where Q is an independent copy of Q. In particu- 
lar, conditionally on (M u (x), X u (x)), the number of rectangles in Q UjX that intersect 
S x (minus 1), viewed as a process of t, has the same distribution as the process 
(NM u (x)t(X u (x)))t>o where N is defined from Q is the same way as N is defined 
from Q. Since Mq(x) and Mi{x) have the same distribution, ([T]) yields 



E[N t (x)} = P(i> n )+2E N Mo{x) ^ Tl) (X (x)) 



(2) 



with the convention N t (x) = whenever t < 0. More generally, if we write ik G A for 
3ft = ... repeated k times, then for every positive integer fc, 

E[N t (x)] = g k (t) + 2 k E[N Msiixh ,, Mskix)t _ Fk (X ik (x))\ , (3) 

where g k is a function such that < g k < 2 k — 1 and Fk is a nonnegative random 
variable defined by 

k k 

Fk - ' : ' r; - 

»=1 3=i 

with (fj)i>i a sequence of independent exponential variables with parameter 1. 

We know compute the joint distribution of (Mq(x), Xq(x)) which will be of great 
use throughout this work. If / is a nonnegative measurable function, easy calculations 
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yield 



E 



f(M (x),X (x)) 



i ,1 / 
du / dv l a 
Jo \ 



. ( x\ ( . x — u\ 

CuJ \ UV, -J + 1 X>U J I (1 - U)V, — — I 

dy 



i-s 



— / dm/(m, y) + / — — / dm/(m, y) (4) 
dm / — f(m,y) + f dm 

o Jx v J 

f x dy 

dm / f(m,y)+ / dm / -^-/(ro,j$) 

o Jo 1 ~ V Ji-x Jl-i=* i - y 



dy 



f{m,y) 



dy 



2.3. Depoissonization 



The following lemma contains a large deviations argument that will enable us to 
shift results from the continuous-time model to the discrete-time one. 

Lemma 1. For every e > 0, we have 



E 



SU P \ N r n {x) - N n (x)\ l r „£[n(l- e ),n(l+e)] 

ce[o,i] 



0. 



Proof. Note that for every x g [0, 1], 1 1 — > N t (x) is non-decreasing and that N t (x) is 
at most the number of points fallen so far: N t (x) < max{i G Z + : Tj < t}. In particular 
•^Vr„ (^O < n ; thus we have 

sup \N Tn (x) - iV„(x)| 2 l Tn>n(1+£) < n 2 l Tn>n[1+e) . 

xe[o,i] 

A large deviations argument ensures that n 2 P(r„ > n(l + e)) tends to as n — > oo. 
On the other hand, applying the Cauchy-Schwarz inequality, we obtain 



E 



SU P \ N r n (x) - N n (x)\ l TB <„(l +e ) 

xe[o,i] 



< [(max{i i Z+ : n < n}) 4 ] JP(r„ < n(l - e)). 



As E[(max{i g Z + : < n}) 4 ] = 0(n 4 ), large deviations ensure that the quantity in 
the right-hand side tends to as n — > oo. Finally, Lemma [I] is proved. 



3. Particular cases and fragmentation theory 

We give below the definition of a particular case of fragmentation process. For 
more details, we refer to [T]. Let v be a probability measure on {(s\,S2) : s\ > 
S2 > and s\ + S2 < 1}. A self-similar fragmentation (^t)t>o with dislocation 
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measure v and index of self-similarity 1 is a Markov process with values in the set 
= {(si,S2,...) : si > S2 > • • • > and Sj < 1} describing the evolution 
of the masses of particles that undergo fragmentation. The process is informally 
characterized as follows: if at time t we have ,^{t) = (si(i), S2(i), . . . ), then for every 
i > 1, the i-th "particle" of mass s^(i) lives an exponential time with parameter Sj(i) 
before splitting into two particles of masses r\Si(t) and r2Si(t), where {r\,r2) has 
been sampled from v independently of the past and of the other particles. In other 
words, each particle undergoes a self-similar fragmentation with time rescaled by its 
mass. In the next section we establish a link between fragmentation theory and the 
process N t (U), where U is a r.v. uniformly distributed over [0, 1] and independent of 
(Q(i))t>o- This connection will provide a new proof of a result of [6] and [3]. See also 
[1] for another recent application of fragmentation theory to a combinatorial problem 
where the exponent ^^~ 3 appears. 

3.1. The uniform case 

We consider here the case where the point x is chosen at random uniformly over 
[0, 1] and independently of (Q(£))t>o- 

Proposition 1. Let U be a random variable uniformly distributed over [0, 1] and inde- 
pendent of the quadtree (Q(£))t>o- Let u 6 A and denote by Uq = , ui, • • • , = u its 
ancestors. Then X U (U) is uniform over [0,1] and independent of (M Ul (U), M Uk (U)), 
which is a sequence of independent random variables all having density 2(1— TO)l m e[o,i] • 

Proof. We prove Proposition [l] by induction on k. Let u G A. Denote by uq = 
0,11%, ... ,Uk = u its ancestors. Integrating Q for x e [0, 1], we deduce that for every 
V E {0, 1}, X V (U) and M V (U) are independent and distributed according to 

l«e[o,i]du ® l m£ [ ,i]2(l - m)dm. (6) 

Recalling that Q Ul ,u(t) = {R e Q(t+n) : R C R Ul (U)}, conditionally on (X Ul (U), M Ul (U)), 
the process &r u1 (u)(Qu 1 ,u) has the same distribution as (Q(M Ul (U)t))t>o, where Q 
is an independent copy of Q. Since X Ul (U) is uniform over [0,1], we deduce by 
induction on the subquadtree Q Ul ,u that X U (U) is uniform over [0, 1] and independent 
of (M U2 (U), . . . ,M Uk (U)) which is a sequence of independent r.v. all having density 
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2(1 — ?7i)l m g[o.i] ■ Furthermore it is easy to see that 

E [(X Ui (U), M Ui (U)) 2 <i< k | (X U1 (U),M U1 ([/))] = E [{X Ui (U),M Ui (U)h< z < k \X Ul (U) 

Hence by §6§, X U (U) is also independent of M Ul (U). 

Letting m(t) = E[N t (U)], (recall that when t < 0, N t (x) = for all x G [0,1]) 
equation ^ becomes 

m(t) = P(*>n)+2E[m(M(t-ri))], (7) 
where M is independent of T\ and has density 2(1 — m)l m£ [o,i]. 

Proposition 2. Let U be uniform over [0, 1] and independent of (Q(i))t>o- We have 
the following convergence 



Proof. We consider an auxiliary fragmentation process (^t)t>o with index of self- 
similarity 1 and dislocation probability measure v given by 

v(ds 1 ,ds 2 )f(si,s 2 ) = E[f(M 1 (U)vM (U),M 1 (U)AM (U)) 

In other words, the dislocation measure is given by the law of the decreasing ordering 
of {Mo(U), Mi(U)}. More precisely (J^t)t>o takes its values in and satisfies the 
following equation in distribution which completely characterizes its law: 

= ((lt<r) + (WW) • *Zw-r)) t>0 + {^MlM ■ ^£W-r)) t>0 ) ' 

with (^j°^)t>o and )t>o two independent copies of (J^t)t>o also independent of 

(Mq(U), Mi(U), t) and r an independent exponential variable with parameter 1. The 
symbol + means concatenation of sequences and (.)^ is the decreasing reordering (and 
erasing of zeros). Then, it is straightforward to see that the expectation of the number 
#J?t of fragments of & t minus 1 satisfies the same equation as E[7V t ([/)], namely letting 
m(f) = E[#J?f - 1] for t > 0, and m(i) = for t < we have 

m(t) = P(*>n) + 2E[m(M(t-n))], (8) 
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where M is independent of T\ and has density 2(1 — Ti)l TO e[o,i]- By Q and (JSJ) , the 
functions m and m satisfy the same integral equation, 

f(t) = l-e~ t + 2 f dm2(l-m) / ds e _s /(m(i - s)). 
Jo Jo 

Differentiating with respect to i, we see that both m and m are solutions of the Cauchy 
problem for the integro-differential equation 

dtf(t) = 1 - f(t) + I dm2(l-m)/(mt), 
/(0) = 0. 



o 



Uniqueness of solution of this kind of integro-differential equation is known, see e.g. [5]. 
We deduce that for every t > 0, m(f) = m(t). We now focus on m(t). Following [21 
Section 3], we let for every B > 0, ?/>(/3) = 1 — / f(dsi) dsaXsf +sf)- An eas y calculation 
yields: 

/rm /? 2 + 3/3 - 2 

W) = 0J + D0J + 2)- 

In particular the Malthusian exponent associated to z/, which is characterized by — 
(see [2 Section 1.2.2]), is 

B* = . 

H 2 

Applying [2, Theorem 1], we get: 

T{l-B*) 4 ^ / /3*\ / 8* W /3* 



Finally, we use the Weierstrass identity for the gamma function: for every complex 
number z G C \ Z_ , 

r(z + l) = e-T 2 [](l + ^) _ e z / fe , 
fe=l ' 

where 7 is the Euler-Mascheroni constant. We conclude that 



iim t-rEW (U)] = 4 r(Vl7 + l) 1 1 = r(2Q3* + l)) 

1 V yJ /3*(2^*+3)r(yi7-/3* + l)r 2 (/3*+2)l + /3*/2 2T 3 (B* + 1) : 

which completes the proof of the proposition. 

Remark 1. One can derive the following equality in distribution from (JlJ: 
N t (U) ® In^t + ^Sfroct-nj^o^+JvS^^CX!^)), 
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where (iV^ ) t >o and (N^) t > are independent copies of the process (N t ) t > . We have 
already noticed that Xq(U) and X\(U) are also uniform and independent of (iVj ) t >o, 
of (iV t (1) )i> and of (M (U), Ah{U)). If X (U) and Xi(U) were independent, then 
N t {U) would satisfy the same distributional equation as — l)t>o- However, this 

is not the case since we have Xq{U) = X\{U). This explains why we had to work with 
expectations. 

Corollary 1. ([6], [3j.) We have 

,to,.-»-E[K( t / ) ] = §^tai. 

Proof. This is a straightforward application of Lemma [T] and Proposition [2] 
3.2. Case a; = 

As a further example of the connection with fragmentation theory, we derive asymp- 
totics properties for iV t (0). In this case, the sequence of the areas of the rectangles 
crossed by So is a fragmentation process, enabling us to state a convergence of A^(0), 
once rescaled, in L 2 . A convergence in mean has already been obtained in (6j Theorem 
6] and [7J. 



m = £ Leb(i? u (0))^ X l Ru{0 )eQ(t), t > 0, 



Theorem 2. The random variable 

v V2-i, 
ueA 

is a uniformly integrable martingale which converges almost surely to SJlca as t — > oo. 
The distribution of StJtoo is characterized by 

E[aJtoo]=l and MoiO^M^ +M 1 (o)^- 1 mg\ (9) 

where VJl^S anddJl^} are two independent copies o/SJtoo also independent o/(Mo(Q), Mi(0)). 
Furthermore, we have the following convergence in L 2 : 

^ V2r3(V2) °°- 

Proof. It is easy to check from properties of Poisson measures that the rearrangement 
in decreasing order of the masses of the rectangles living at time t and intersecting Sq , 

t>o 



(Leb(i? u (0))l i?i 



(o)eQ(t; 
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is a self-similar fragmentation with index 1 and dislocation probability measure given 
by the decreasing ordering of {Mq(Q), Mi(0)}. As in the proof of Proposition^ we 
introduce for every ft > 0, = 1 — E[M o (0)^ + Mi(0) /3 ], which is easily computed: 

[P1 (/3 + 1) 2 

Thus the Malthusian exponent p* of this fragmentation satisfying ^(p*) = is 

p* = V2- 1. 

The first two points of the theorem follow from classical results of fragmentation theory, 
see [TJ Theorem 1.1]. We refer to [TU] for the characterization of the law of 9Jtoo via 
the distributional equation ^ and to [11] for some of its properties. The last point 
conies from [2] Corollary 6] and the Weierstrass identity for the gamma function used 
in a similar manner as in the proof of Proposition [2j 

Corollary 2. We have the following convergence in L 2 : 

n^N (0) — > F(2 ^ M 

Proof. This proposition easily derives from Lemma [l] and Theorem [2] 
Remark 2. Observe that Corollary[2]implies the following convergence in distribution: 

Remark 3. It is worthwhile to notice that the behavior of the cost of the partial match 
query in the case x = is drastically different from its behavior in the case when x is 
uniform or x is fixed in (0,1) (see Theorem [l] and Proposition [2]) . 

3.3. An a priori uniform bound 

This section is devoted to the proof of an a priori uniform bound on s - ' 3 E[A 4 (x)] 
over (x, s) £ (0, 1) x (0, oo) that will be useful in many places. 

Lemma 2. There exists C < oo such that 



sup sup E 

xG(0,l) s>0 



s~ p *N s (x) 



< C. (10) 
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Proof. As a warmup, we start by proving that there exists C\ < oo such that for 
every x £ (0, 1), 

Ci 



sup E s -P"N s (x) 
s>o L 



< 



(11) 



x A (1 - x) 

Combining |2]) with the densities computed in Q, we deduce that for every x € (0, 1) 

^dy 



t- p E[N t (x)] = t 



>T1 



y Jo 



dmE 



t~P' N m(t _ Tl) (y) 



x dy 



1 — 03 



dmE 



/ i / ■■ — "" ■ ' i (1^) 

lo 1 — y Jo L J ) 

By monotony of t i-> AT t (a;) we have E [t' 13 " A m(t _ Tl) (y)] < E [i^* A t (y)] . Further- 
more, recalling that (3* < 1, there exists a constant C" such that for every £ > 
t~^¥(t > ri) < C". Hence 



r r E[AT t (x)] < C" + 2 
< C" 

= c" 



1 xdy 
2 



E 



t- p *N t (y) 



x (1 - x)dy 

(i-y) 2 



E 



a; A (1 — x) J 
2 



dyE 



t~ 'N t (y) 



E[t- *N t ([/)]. 



a; A (1 - x) 

It has been shown in Proposition [2] that E [t-^*iVt(J7)] has a finite limit as £ — > 00, 
and for every t > 0, E [JV t (t7)] < i. Thus the quantity E [t~P* N t (U)] is bounded over 
(0,oo). The inequality (11) follows from these considerations. 

Introducing S(x) — sup s>0 E[iV a (x)] for every x € [0,1], we have just shown 
that S(x) < d(x A (1 - a:)) -1 . Using Q, we have for every x E (1/2, 1): 

L y Jo 



S(x) — sup < t 
t>o 



>rx) + 2 



dmE 



t~ P N m(t _ Tl) (y) 



x dy 



1 -yJo 



< C" + 2sup 



dmE 
l dy f 1 



y Jo 



dmE 



m(t-Tx){y) 

t~ P 'N t (y) 



1/2 



dy 
1 ~V Jo 



dmE 



t- p 'N t (y) 



-2 sup 



x dy 



t>o J1/2 1 - y Jo 



dmm" E 



< C" + 8sup / dyE N t (y) 
t>o Jo L 



{mt)-' r N mt {y) 
x dy 



< C 2 + — (1-x) 



i/2 dy (i-yr +2 



1/2 1 - y Jo 
S(y). 



dmmr S(y) 



1 



(13) 
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Let us show that this implies that for every x G (0,1), S(x) < IOOC2. Arguing 
by contradiction, suppose that there exists a G (1/2,1) such that 5(a) > IOOC2. 



Let 5 — S(x). By (11), 5 is finite; there exists 6 G [1/2, a] such that 

5(6) > 0.95. In particular, 5(6) > 0.9 sup x£[1/2jb] S(x) and 5(6) > 90C 2 . Applying 
(13) at 6, we get 

5(6) < 90^5(6) + ^(1 - bf^ J" dy {i - ^, +2 0.9^5(6) 



< 90" 1 5(6) + IQm ^ 2 5(6), 



2-0.9" 1 

W + 1) 

leading to a contradiction since 03* + l) 2 > f^r. Finally, S(x) < 100C 2 for every 
x € (0,1). 



4. The convergence at fixed x 6 (0, 1) 

We prove in this section that when x E [0, 1] is fixed, E[N t (x)] admits a finite 
limit as t — > 00. The results of the preceding section do not directly apply since the 
place Xq(x) of x in the rectangle Rq(x) highly depends on x. Recall notation ik for 
the word composed of k zeros 0...0 6 A The guiding idea is that the splittings tend 
to make X ik (x) uniform and independent of M ih (x). 

4.1. A key Markov chain 

Fix x € (0,1). To simplify notation, for every k > 1, we write Xk for X ik (x) 
and Mk for M, k (x). We shall focus on the process (Xk, Mk)k>o, which is obviously a 
homogeneous Markov chain starting from (x, 1) whose transition probability is given 
by Q or ([5]). Let k > 1. We denote by Tk the filtration generated by (Xi, Mj)i<j<fc. 
It is easy to see that the transition probability only depends on Xk, that is 

E[(X fc+is M fc+i ) i > 1 |Jifc] =E[(X fe+i ,M fe+i ) i > 1 |X fe ]. 

Proposition 4.1. Fix x € (0,1). There exists a coupling of the chain (Xk, Mk)k>o 
with a random time T G Z + smc/i t/iat for any k > 0, conditionally on {T < /c}, 
t/ie r.u. Aj. is uniformly distributed over [0,1], independent of (Mj)i<j<fe and 0/ T. 
Furthermore, we have 

E [1.15 T ] < +00. 
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Proof. For any k > 1 we consider the event 

E k = {M k < X k _ x A (1 - X k -i)} . 

Using the explicit densities Q and one sees that conditionally on J- k -\ and on 
the event E k of probability — (X k _i A (1 — X k _i)) ln(Xj._i(l — X k _x)), the conditional 
distribution of Xk is 

-\n(X k Jl-X k _ l} ) (rh; W^x) + d». 

In particular, conditionally on and J- k -i, the variable is independent of Mj, and 
has a density bounded from below by — l/ln(Xfc_i(l — X k -\)). Thus, we can construct 
simultaneously with (X k , M k )k>o a sequence of random variables (B k )k>o € {0, 1} Z+ as 
follows. Suppose that we have constructed (X^, M iy i?i)o<i<fc— l ■ Then independently of 
J-fc_i, toss a Bernoulli variable of parameter — (X k -i A (1 — X k -i)) ln(JCfe_i(l — X k _i)). 
If comes out, we consider that we are on the event EZ, then put B k = and sample 
(X k , M k ) with the conditional distribution on E k and T k -\- If 1 comes out, we consider 
that we are on the event E k and we proceed to the following. 

1. First sample Mk from its distribution conditionally on E k and T k -\. 

2. Then independently of M k , toss a Bernoulli variable B k of parameter — l/ln(Xfc_i(l— 
Xk-ij). If Bk = 1, sample X k uniformly from [0,1] and independently of 
(Mi, . . . , M k ). Otherwise, sample X k with density 

independently of (Mi, . . . , M fe ). 

The device provides us with a Markov chain (X k , M k , B k ) k >o such that the first two 
coordinates have the law of the process introduced before Proposition |4.1| We then let 

T = inf{fc > 0,B k = 1}. 

By definition of T, the random variable Xj> is sampled uniformly over [0, 1] and 
independently of (Mi, . . . , Mp). We deduce that the process (Xt+i, Mr+i)i>i has the 
same distribution as the process (X ik (U), M ik (U)) k >i defined in Proposition [11 hence 
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an easy adaptation of Proposition [I] shows that for every positive integer i, Xx+i is 
uniformly distributed over [0, 1] independent of (Mi, . . . , Mx+i) and of T. This proves 
the first part of Proposition |4.1| 

For the second part, we need to evaluate the tail of the random time T. We introduce 
the following variation. Let {Xf.)k>o be a Markov chain with space state (0, 1) U 
{$}, where d is a cemetery point. Informally, this chain is the chain (X k ) until we 
reach the time T, then it is killed and sent to the cemetery point. Thanks to the 
calculation presented at the beginning of the proof, it should be clear that given Xk-i 
and conditionally on {T > k — 1}, the probability of the event {T = k} is Xk-i A 
(1 — Xk-i)- Thus the transition probability for the chain (Xk) is defined as follows: for 
every x € (0, 1), 

1-x 

and p(d,dy) = Sq. By construction of this chain, the stopping time T = inf{/c > 1 : 
Xk = d} has the same distribution as T. In order to estimate T, we define the following 
potential function V : (0, 1) U {d} — > [l,oo]: 

. 10 10 

V(X) = l x =d + —j= lag (0,1/2) + n — lzg[l/2,l)- 

V x \J I — X 

Then one can show that for every x £ (0, 1) U {d}, 

p(x,dy)V(y) < 0MV(x) + l {d} (x), 
so that [TSJ Theorem 15.2.5] may be applied: there exists e > such that for all 

xe (o,i), 



p(x, Ay) 



(1 ■ — X X \ 

(X _ y y 1 y£(o,x) + ^veix,!) -xA(l-x)\ dy, 



E 



T-l 



1.15* 



fe=0 



< E^l.l^Vix), 



from which we deduce that 



E 



1.15 J 



< oo 



(note that the last quantity is not uniformly bounded for x 6 (0, 1)). This completes 
the proof of Proposition |4.1| 

In the remaining part of this section, x is fixed in (0, 1). Coming back to ^ and 
writing M k = M1M2 ■ ■ ■ M k for the Lebesgue measure of R lk (x), we have 



E[N t (x)]=t-r(g k {t) 



■ 2 E 



N- 



M k t-F, 



{x k )i 



T>k 



2 k E 



N- 



Mut-F, 



{X k )l 



T<k 



(14) 
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We shall treat separately the last two terms of (14). 



4.2. Study of t-P'2 k E[N Mkt _ Fk (X k )l T>k ] 

We shall see that 2 k E[N-^ kt _ Fk {X k )lT>k] is arbitrarily small uniformly in t 
provided that the integer k is chosen large enough. Observe 

t-^2 k E[N^ kt _ Fk (X k )l T>k ] < t-^2 k E[NjjjX k )l T>k ] 



2 k E 



2 k E 



Aff (M k t)-^N mt (X k )l T>k 



Aff l T>fc E 



(M k t)- p ' NjgJXk) a(M k ,X k ,T) 



Letting tfi be the map (s,u) >-> E[s " N a (u)], we have: 

t~P'2 k E[N JIkt _ Fk (X k )l T>k ] < 2 k E\Ml'l T>k (j ) (M k t,X k ) 



Thanks to ( 10 ),</>< C, so that the quantity in the last display is at most C2 k E 



M' k 1t>a 



Holder's inequality yields for every p > 1 



C2 k E 



M k l T >k 



- C2 k E 



M 



i/p 



E\l 



T>k\ 



il-l/p 



The last term is easily treated, by Markov's inequality we have E [lr>fc] < 1.15 fe E[1.15 T ]. 

— P*p 

Concerning E[M fc ] we have 



E 



< E 



M i2 (xf**...M u (xf'r 

&\y)dyE [M il ( v r p ---M lk _ 1 {vf' P 

where is the density of Xi under P. It is easy to see from Q that is bounded 
from above by (x A (1 — Hence 



E 



M 



< /TTT i / ^\M k - 1 (yf' P 

xA{l-x)J L 



Recall from Proposition [I] that when x — U is uniform over [0, 1] and independent 
of (Q(i)){>o, then M }1 (U), . . . , M ik (U) are independent and distributed according to 
lm£[o,i]2(l — m)dm. In particular 

2 



E 



M (CO 



{f3* P +l){p*p + 2) 



and thus 



fc-i 
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Gathering all these estimates, we obtain 



t-^2 k E[Njj kt _ Fk (X k )l T>k ] 



< C2 k 



1 



x A (1 — x) 
2 



i/p 



(k-i)/ P 



E[l.l5 T ] 1 ~ 1/p 1.15- fe ( 1 - 1 /*') 



2 V lp V 

(/3*p+l)(/?*p + 2)J ■ J ' 

where -ftT^a, is a constant that only depends on p and x but on k. Now, one can easily 
prove that for p > 1 sufficiently close to 1, the term between brackets in the last display 
becomes strictly less than 1. Consequently, letting e > fixed, there exists an integer 



k sufficiently large such that for every t > 0, 

t-e*2 k E[N Mkt _ Fk (X k )l T>k ] < s. 

4.3. Conclusion 

Observe that we have for every t > 



(15) 



t~ p '2 k E 



N mt . Fk (X k )l 



T<k 



2 k E 



= 2 fc E 



l-T<k 



E 



M k t-F k ( X k 



a(M k ,F k ,T) 



l T <k{M k - r L F k ) p + E (M k t - F k )-/' 'N Mht _ Fk (X k ) a(M k ,F k ,T) 



where y+ denotes y V 0. By Proposition 4.1 on the event {T < fc}, the r.v. X k is 



uniformly distributed over [0, 1] and independent of Mi , . . . , M k thus of M k . It is also 
independent of F k and T. Hence, letting 9 be the map s h- > E^^ N S (U)], where U is 
a random variable uniformly distributed on (0,1) independent of N, we have: 



t~ p *2 k E 



Njj t _ F , (X k )l T <k 



2 k E 



l-T<k 



.,{M k -r l F k f + 0(M k t~F k ) 



Applying Proposition [2] 8(M k t — F k ) a.s. tends to a finite limit as t — > 00. Hence by 
dominated convergence t~ p 2 k E [Njj t _ F (X k )lT<k\ has a finite limit as t — > 00. We 



deduce from this fact, (14) and (115]) that 



]bnsupt~ p E[N t (x)]-]imMr p E[N t (x)} < e. 

Since that inequality holds for every e > 0, t~ p E[N t (x)} has a finite limit as t — » 00 
which we denote by n^x): 



,(x) = lim t~ p E[N t (x)]. 
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5. Identifying the limit 



In this section, we show that x i-> n M (i) is proportional to i 4 (x(l — x)) 13 I" 1 
using a fixed point argument for integral equation (see also [U Section 4.1] for a similar 
application) . The normalizing constant will come from the L^-norm of x >-4> (x(l — 
x)Y I 2 and the constant of Proposition [2] 

Combining |2]) with the densities computed in Q, we deduce that 

' rl dy 



t- "E[N t (x)] = r fl *¥{t>T 1 ) + 2 



dy 

1-2/ 



y Jo 

dmmP E 



dmm fr E\(mt)-P* N m(t „ Tl) (y) 
(mt)- pt N m[t _ Tl) (y) 



Thanks to Lemma [2j we get by dominated convergence 



P* +1 

In other words, if we define 
2 



dy- 



9x{y) = 
we have 



1 



y 



1 



1 

/3*+2 



«oo(y) + (1 - x) 



/3* + l 



^i a <j/<i + 0--zy 



+1 



d V (1 _ y)/J . +2 »oo(w) 



1 



,,/3*+2 



(i-y) 



^00 (sc) = 



dyg x {y)noo{y). 



Let G be the operator that maps a function / s L 1 [0, 1] to the function 



G(/)(x) 



dyg x {y)f{y)- 



In particular, fixed point of G. It is easy to check that x £ (0, 1) 1— ► (feO) € 

L^O, 1] is continuous for the L^-norm. Furthermore, Lemma [2] ensures that In^x)! < 
G for every x € (0,1). As a consequence, x H> n 00 (x) is continuous over (0,1). An 
easy computation shows that for every y £ (0, 1), dxg x (y) = 1. Let p be another 
fixed point of G having the same integral as n^. Then 



dx|n 00 (a;) - p(x)\ 



dx 



dygxivKrioo -p)(y) 



< 



dx / dyg x (y) \ (y) - p(y) \ 



dy\ n °o(y) - p(y)\, 
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which shows that the inequality is in fact an equality. Hence —p has a.e. a constant 
sign. As we know that the integral of — p is zero, we deduce that = p a.e. 
Straightforward calculations prove that po : x M> [x(l — x)Y ^ 2 is also a fixed point of 
G of L 1 -norm, so that 



MWMlr'Wi-z)) 



a.e. 



Since rioo and po are continuous, we can remove the a.e. statement (observe that 
noo (0) = ^(l) = by Theorem Plainly, 



i|po||i 



r 2 ( £ + 1 



r(/?* + 2) • 

On the other hand, (|10[) and the dominated convergence theorem ensure that ||noo||i 



lim^oo t & E[iV t (/7)], which was computed in Proposition^ 

r(2(/3* + 1)) 



loo 1 = 



2r 3 (/3* + 1) ' 

Proof of Theorem^ To sum up, we have for every x £ [0, 1]: 

t^oo 2 r 3 (/3* + i)r 2 + 1) 

Applying Lemma [l] Theorem [l] is shown. 



6. Extensions and comments 

6.1. Various convergences 

In this paper, we only proved a convergence in mean of t-P"N t (x). We may wonder 
whether this quantity also converges in distribution, in probability, or even almost 
surely. A more interesting question is the following: does the process ((t~" N t (x)) x£ [o,i] 1 1 > 
0) converge in distribution in the Skorokhod sense to a random function (C(aO)a;e[o,i] 
as t — > 00? Observe that if it does, then there exists a random point U uniformly 
distributed over (0, 1) such that C(U) = 0, U corresponding to the point x\ of the first 
atom of II (Nt(xi) is indeed of order t^ 2 ^ 1 by Theorem [2]) . 

Conjecture 1. We have the functional limit law (t~^ Nt(x)) X £[o,i) ~^ {C{x)) x e[o,i\ as 
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t — > oo in (D([0, 1]), || • | |oo); where C satisfies the distributional fixed point equation 
(C(*)). eM = (W„ {(C/oC/i) r C< M > (£) + (C/o(l - U,)f (£) } 

+u{((i-Wf'c (I0! (^) 

+ ((1 -^ (1 -^ rc(11) (^)}), [0 v 

where Uq, U\, C^°\ C^ 01 \ C^ 10 \ C^ 11 -* are independent, Uq and U\ are uniformly 
distributed on [0,1] and C^, C {m \ C {w \ C {11 ^ have all the same distribution as C. 

6.2. Multidimensional case 

The strategy adopted in Section |3.1| may be generalized to higher dimensions. As 
for the convergence in mean of the number of hyper-rectangles crossed by a fixed affinc 
subspace having a direction generated by some vectors of the canonical basis, our 
approach may also be followed. 

6.3. Quadtree as a model of random geometry 

On top of its numerous applications in theoritical computer science, the model of 
random quadtree may be considered as a model of random geometry. More precisely 
one can view, for t > 0, the set of rectangles Q(t) as a random graph, assigning length 
1 to each edge of the rectangles. We denote this graph by Q(t). A natural question 
would be to understand the metric behavior of Q(t) as t — > oo? If L t is the graph 
distance in Q(i) between the up-left and up-right corners, then Theorem [2] already 
shows that L t is less than the order t^^ 1 . 

Problem 1. Is (t 1 ^^ L t ) , >Q tight? Does it converge in distribution? If not, find the 
right power of t. 
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